A Relational Operator Approach to Data Fusion
نویسنده
چکیده
Integrated information systems provide users with a single unified view to heterogeneous data sources. As the resolution of schema level conflicts and the detection of fuzzy duplicates has been looked at more comprehensively, the problem of resolving data level conflicts still remains. We propose a relational data fusion operator, which fuses tuples representing the same real world entity by resolving conflicts in the attributes. Syntax and semantics of the operator are given as well as an extension of Sql. Furthermore, optimization issues involving transformations of logical query plans involving fusion and enabling cost based optimization for fusion are addressed. An implementation of the operator as part of a research prototype is under way.
منابع مشابه
A New Approach to Self-Localization for Mobile Robots Using Sensor Data Fusion
This paper proposes a new approach for calibration of dead reckoning process. Using the well-known UMBmark (University of Michigan Benchmark) is not sufficient for a desirable calibration of dead reckoning. Besides, existing calibration methods usually require explicit measurement of actual motion of the robot. Some recent methods use the smart encoder trailer or long range finder sensors such ...
متن کاملExtending Relational Algebra to express one-to-many data transformations
Application scenarios such as legacy-data migration, ETL processes, data cleaning and data-integration require the transformation of input tuples into output tuples. Traditional approaches for implementing these data transformations enclose solutions as Persistent Stored Modules (PSM) executed by an RDBMS or transformation code using a commercial ETL tool. Neither of these solutions is easily m...
متن کاملDeclarative Data Fusion - Syntax, Semantics, and Implementation
In today’s integrating information systems data fusion, i.e., the merging of multiple tuples about the same real-world object into a single tuple, is left to ETL tools and other specialized software. While much attention has been paid to architecture, query languages, and query execution, the final step of actually fusing data from multiple sources into a consistent and homogeneous set is often...
متن کاملMax-Min averaging operator: fuzzy inequality systems and resolution
Minimum and maximum operators are two well-known t-norm and s-norm used frequently in fuzzy systems. In this paper, two different types of fuzzy inequalities are simultaneously studied where the convex combination of minimum and maximum operators is applied as the fuzzy relational composition. Some basic properties and theoretical aspects of the problem are derived and four necessary and suffi...
متن کاملA COGNITIVE STYLE AND AGGREGATION OPERATOR MODEL: A LINGUISTIC APPROACH FOR CLASSIFICATION AND SELECTION OF THE AGGREGATION OPERATORS
Aggregation operators (AOs) have been studied by many schol- ars. As many AOs are proposed, there is still lacking approach to classify the categories of AO, and to select the appropriate AO within the AO candidates. In this research, each AO can be regarded as a cognitive style or individual dierence. A Cognitive Style and Aggregation Operator (CSAO) model is pro- posed to analyze the mapping ...
متن کامل